AITopics

Country:

South America > Argentina > Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report > New Finding (0.93)
Research Report > Experimental Study (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Data Science (0.68)

Neural Information Processing SystemsFeb-17-2026, 07:18:37 GMT

Semi-Synthetic Transformers for Evaluating Mechanistic Interpretability Techniques

Training (IIT) which we call Strict IIT (SIIT). SIIT models maintain Tracr's original circuit while being more realistic.

artificial intelligence, machine learning, natural language, (15 more...)

Country:

South America > Argentina > Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report > New Finding (0.93)
Research Report > Experimental Study (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Data Science (0.68)

Neural Information Processing SystemsOct-10-2025, 12:42:20 GMT

a8f7d43ae092d9a5295775eb17f3f4f7-Supplemental-Datasets_and_Benchmarks_Track.pdf

node, sequence, transformer, (14 more...)

Country:

South America > Argentina > Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report > New Finding (0.93)
Research Report > Experimental Study (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Data Science (0.67)

Neural Information Processing SystemsOct-10-2025, 12:42:16 GMT

Semi-Synthetic Transformers for Evaluating Mechanistic Interpretability Techniques

Training (IIT) which we call Strict IIT (SIIT). SIIT models maintain Tracr's original circuit while being more realistic.

node, sequence, transformer, (12 more...)

Country:

South America > Argentina > Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report > New Finding (0.93)
Research Report > Experimental Study (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Data Science (0.68)

Zhang, Tianren, Jiang, Yizhou, Chen, Feng

A method of supervised learning from conflicting data with hidden contexts

arXiv.org Artificial IntelligenceFeb-13-2025

Conventional supervised learning assumes a stable input-output relationship. However, this assumption fails in open-ended training settings where the input-output relationship depends on hidden contexts. In this work, we formulate a more general supervised learning problem in which training data is drawn from multiple unobservable domains, each potentially exhibiting distinct input-output maps. This inherent conflict in data renders standard empirical risk minimization training ineffective. To address this challenge, we propose a method LEAF that introduces an allocation function, which learns to assign conflicting data to different predictive models. We establish a connection between LEAF and a variant of the Expectation-Maximization algorithm, allowing us to derive an analytical expression for the allocation function. Finally, we provide a theoretical analysis of LEAF and empirically validate its effectiveness on both synthetic and real-world tasks involving conflicting data.

allocation function, large language model, machine learning, (20 more...)

2108.12113

Genre: Research Report > New Finding (0.67)

Industry: Education (0.87)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.92)
(3 more...)

Yang, Yejiang, Mo, Zihao, Xiang, Weiming

Efficient Neural Hybrid System Learning and Transition System Abstraction for Dynamical Systems

arXiv.org Artificial IntelligenceNov-15-2024

Abstract: This paper proposes a neural network hybrid modeling framework for dynamics learning to promote an interpretable, computationally efficient way of dynamics learning and system identification. First, a low-level model will be trained to learn the system dynamics, which utilizes multiple simple neural networks to approximate the local dynamics generated from data-driven partitions. Then, based on the low-level model, a high-level model will be trained to abstract the low-level neural hybrid system model into a transition system that allows Computational Tree Logic Verification to promote the model's ability with human interaction and verification efficiency. Keywords: Hybrid and Distributed System Modeling; Neural Networks; Nonlinear System Modeling; Maximum-Entropy Partitioning; Model Abstraction.

artificial intelligence, machine learning, neural network, (14 more...)

2411.1024

Country:

Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
North America > United States > Georgia > Richmond County > Augusta (0.04)
Asia > China > Sichuan Province > Chengdu (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Gupta, Rohan, Arcuschin, Iván, Kwa, Thomas, Garriga-Alonso, Adrià

InterpBench: Semi-Synthetic Transformers for Evaluating Mechanistic Interpretability Techniques

arXiv.org Artificial IntelligenceJul-19-2024

Mechanistic interpretability methods aim to identify the algorithm a neural network implements, but it is difficult to validate such methods when the true algorithm is unknown. This work presents InterpBench, a collection of semi-synthetic yet realistic transformers with known circuits for evaluating these techniques. We train these neural networks using a stricter version of Interchange Intervention Training (IIT) which we call Strict IIT (SIIT). Like the original, SIIT trains neural networks by aligning their internal computation with a desired high-level causal model, but it also prevents non-circuit nodes from affecting the model's output. We evaluate SIIT on sparse transformers produced by the Tracr tool and find that SIIT models maintain Tracr's original circuit while being more realistic. SIIT can also train transformers with larger circuits, like Indirect Object Identification (IOI). Finally, we use our benchmark to evaluate existing circuit discovery techniques.

benchmark, node, transformer, (13 more...)

2407.14494

Country: Europe > Austria > Vienna (0.14)

Genre:

Research Report > New Finding (0.68)
Research Report > Experimental Study (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)

Kekić, Armin, Schölkopf, Bernhard, Besserve, Michel

Targeted Reduction of Causal Models

arXiv.org Machine LearningNov-30-2023

Why does a phenomenon occur? Addressing this question is central to most scientific inquiries based on empirical observations, and often heavily relies on simulations of scientific models. As models become more intricate, deciphering the causes behind these phenomena in high-dimensional spaces of interconnected variables becomes increasingly challenging. Causal machine learning may assist scientists in the discovery of relevant and interpretable patterns of causation in simulations. We introduce Targeted Causal Reduction (TCR), a method for turning complex models into a concise set of causal factors that explain a specific target phenomenon. We derive an information theoretic objective to learn TCR from interventional data or simulations and propose algorithms to optimize this objective efficiently. TCR's ability to generate interpretable high-level explanations from complex models is demonstrated on toy and mechanical systems, illustrating its potential to assist scientists in the study of complex phenomena in a broad range of disciplines.

artificial intelligence, intervention, machine learning, (16 more...)

arXiv.org Machine Learning

2311.18639

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.04)

Genre: Research Report (0.81)

Industry: Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.42)

arXiv.org Artificial IntelligenceAug-30-2023

EnsembleFollower: A Hybrid Car-Following Framework Based On Reinforcement Learning and Hierarchical Planning

Han, Xu, Chen, Xianda, Zhu, Meixin, Cai, Pinlong, Zhou, Jianshan, Chu, Xiaowen

Car-following models have made significant contributions to our understanding of longitudinal driving behavior. However, they often exhibit limited accuracy and flexibility, as they cannot fully capture the complexity inherent in car-following processes, or may falter in unseen scenarios due to their reliance on confined driving skills present in training data. It is worth noting that each car-following model possesses its own strengths and weaknesses depending on specific driving scenarios. Therefore, we propose EnsembleFollower, a hierarchical planning framework for achieving advanced human-like car-following. The EnsembleFollower framework involves a high-level Reinforcement Learning-based agent responsible for judiciously managing multiple low-level car-following models according to the current state, either by selecting an appropriate low-level model to perform an action or by allocating different weights across all low-level components. Moreover, we propose a jerk-constrained kinematic model for more convincing car-following simulations. We evaluate the proposed method based on real-world driving data from the HighD dataset. The experimental results illustrate that EnsembleFollower yields improved accuracy of human-like behavior and achieves effectiveness in combining hybrid models, demonstrating that our proposed framework can handle diverse car-following conditions by leveraging the strengths of various low-level models.

car-following model, low-level model, university, (15 more...)

2308.16008

Country:

Asia > China > Guangdong Province > Guangzhou (0.05)
Asia > China > Hong Kong (0.05)
Asia > China > Shanghai > Shanghai (0.04)
(4 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.93)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Automobiles & Trucks (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Beckers, Sander, Halpern, Joseph Y.

Abstracting Causal Models

arXiv.org Artificial IntelligenceDec-10-2018

We consider a sequence of successively more restrictive definitions of abstraction for causal models, starting with a notion introduced by Rubenstein et al. (2017) called exact transformation that applies to probabilistic causal models, moving to a notion of uniform transformation that applies to deterministic causal models and does not allow differences to be hidden by the "right" choice of distribution, and then to abstraction, where the interventions of interest are determined by the map from low-level states to high-level states, and strong abstraction, which takes more seriously all potential interventions in a model, not just the allowed interventions. We show that procedures for combining micro-variables into macro-variables are instances of our notion of strong abstraction, as are all the examples considered by Rubenstein et al.

artificial intelligence, causal model, intervention, (18 more...)

1812.03789

Country:

North America > United States (0.46)
Europe (0.28)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (1.00)